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Abstract 

We propose a new skewness test statistic for normality based on the Pearson 
measure of skewness. We obtain asymptotic first four moments of the nuU distri- 
bution for this statistic by using a computer algebra system and its normalizing 
transformation based on the Johnson Sjj system. Finally the performance of 
the proposed statistic is shown by comparing the powers of several skewness 
test statistics against some alternative hypotheses. 
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1. Introduction 

A test for normality is an essential problem in statistical pra ctice. Earlier 
studies on tests for normality are summarized in Thode I ( 20021 ). Tradition- 



ally, it is common to use skewness and kurtosis statistics \/hi = m^/my^ 
and 62 = 7714 /m|, respectively, where for a random sample (Xi, A2, . . . , A„), 
TUr = {l/n)YJU{X^-X)\ r = 2,3, and 4, and X = (l/n)Er=i^- They 
are used for detecting skew symmetric and heavy or light tails, respectively. The 
Jarque-Be ra test combining ^ Jh^ a nd &2 is well known as an omnibus test for 



normality (^Jargue and Bera 



1987 )). The improved omnibus test was recently 



presented in ,Nakagawa et al. I (|2011l ). The above test statistics are all based 
on sample moments. Further more, the Shapiro-Wilk jW ) test is famous as an 
extension of probability plots (Shapiro and Wilkl(ll965l) ). 



In addi tion, the Lin-Mudho l kar (L M) test is developed for asymmetric al- 
ternatives ( Lin and Mudholker (1980)). The focus of this paper is also on tests 
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that detect skew symmetric. Based on y^bi and 62, we propose a test statistic 

Vbi (62 + 3) 

and called the sample Pearson measure of skewness (spms). The statistic spms 
corresponds to the Pearson measure of skewness defined by 

_ V/3i(/32+3) 
2 (5/32-6/31 -9)' 

where ^y|3l ~ A'3//^2^^ = ^4/^21 ^^'^ f^i' = 2,3,4, /ir denotes the rth 

population moment about the mean. The pms is introduced by Karl Pearson 
considering Pearso n Systems which are di stinguished nine types including nor- 



mal distributions (jStuart and Ord I (|199J)). Under Pearson Systems, the pms 



actually coincides with a measure of skewness 

mean — mode 
a 

where a is the standard deviation. As the mean and the mode coincide in a 
symmetric population, the distance from the mean to the mode can be treated 
as a measure of skewness. 

The first four moments of the null distribution for spms are explained in 
Section [2j Based on the Johnson Su system, we obtain a transformation to 
approximate the normality of the null distribution of spms in Section [3l In 
Section m the performance of spms is shown by comparing the powers of several 
skewness test statistics against six asymmetric alternatives. 

2. Approximate moments of the null distribution for spms 

In this section, we assume that a sample (Xi, X2, ■ ■ ■ , Xn) is drawn from 
a normal population. The spms is invariant under the origin, and the scale 
changes because of the invariance of ^/bi and 62- It is clear that spms is sym- 
metric. Figure [1] shows a histogram of spms with n = 100. 

To find the approximate moments of the distribution for spms, a l arge 
amount of symbolic computation is required. iNiki and Nakagawa I (I1995I) de- 
scribed a method for this purpose as follows: 

(1) Set U — y/n (m2 — 1) , V — y/nm^, W — \/n [m^ — 3) , and expand spms 
as power series in terms of Xjyfn: 



9 , 95 9 3 o 25 o\ / .,,\ 

4 36 4 108 / V / 

We remark that U = 0(1), V = 0(1), and W = 0(1). 
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Figure 1: Histogram of spms with n = 100 (lO** replications) 



(2) Take expectations term by term. Thus, we obtain the first approximate 
moment of spms. 

(3) The second, third, and fourth moments are obtained from spins'^, spms^, 
and spms*, respectively. 

In step (2), each UW^W^ (0 < i + j + fc < 8) is a symmetric polynomial of 
Xi , X2 , . • . , Xn ■ We emphasize that the algorithm for th e chan ge of bases of 
symmetric polynomials described in iNiki and Nakagawa I (I1995I) is applied in 
computing expecta t ions E [U^V^W''] . The algorithm has been implemented in 
REDUCE (|Hearn I (|2004 )) with utility programs. 

Let Xr be the rth (r = 2,4) moment of the distribution for spms drawn from 
a normal population, each having the following form: 



41n 



/6511 



0{r 



(3) 
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A4 = (^^^n-2+414n-3 + 0(n-4). (4) 

Note that all odd moments are zero. From ([3]), spms is asymptotically normally 
distributed with mean and variance 3/(2n). Thus, we obtain 

^ -^4 o on -1 /48544\ _2 /10386704\ _3 ^, 
p2{spms) = = 3+20n jn ^+0{n 4), (5) 

which will be used for obtaining a normalizing transformation. 

3. Transformation 

A normal approximation o f the null distrib ution for spms is obtained using 
a transformation described in Johnson I (Il949l) . given as follows: 



Theorem 1. Assume that the null hypothesis of normality is true and let 

y.f^, (6) 

V A2 



M/2 

Then 



W^ = -l + v/2 {f32{spms)-l), (7) 
^ ' (8) 

(9) 



\/EgW 




(10) 

is approximately a standard normal variate with mean zero and variance unity. 

This transformation is an an alogy of the normali zing transformation of the null 
distribution for y^bi given bv lD'Agostino ( 1970[ ). 



Figure [2] shows a histogram of Z with n — 200 when 10^ random normal 
samples are generated. The dashed line denotes the probability density function 
of the standard normal distribution. Both one-sided and two-sided tests at any 
desired levels of significance can be performed. For example, for a two-sided 
test with a 0.05 level of significance, the null hypothesis is rejected if \Z\ > 1.96. 

Monte Carlo simulations are performed for judging the accuracy of the trans- 
formation. For samples of sizes n = 100, 150, 200, 300, 500, and 1000, 10^ ran- 
dom normal samples are generated. For each sample, the spms statistic is 
computed and classified into the intervals formed by the two-sided tests with 
levels of significance 0.01, 0.05, 0.10, and 0.20. The entries in Table [T] show the 
Monte Carlo relative frequencies for the intervals. For n > 500, the Monte Carlo 
results agree to at least two decimal places for any significance level. Even if 
n = 100, the results are good for 0.01 level of significance. 
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Figure 2: Histogram of Z l|10|l with n = 200 (10^ replications) and the standard normal 
probability density function (dashed curve) 



4. Power study 

This section demonstrates a comparison of the powers of spms with -^/bi, 
W, and LM tests. Monte Carlo simulations are conducted with samples of 
sizes n = 40,50,60,80, and 100 and lO'' rephcations. Two-sided tests with a 
5% significance level are carried out against six alternative distributions: beta 
distributions with two parameters {p,q) — (2,1), (3, 2); gamma distributions 
with shape parameters a = 2,3; Weibull distributions with shape parameters 
a — 2; and log-normal distribution with mean /z = and standard deviation 
cr = 1/2. The first two distributions have light tails and the rest have heavy 
tails. 

The powers of the above four statistics are summarized in Table [5] and Fig- 
ures [3H5] (spms (o), •y6i(A), VF(-I-), and LM{x)). In most cases, the power of 
the spms test is better than that of the ^/bl test. The power of the spms test 
is superior to that of all other tests in the first two cases. The spms test is 
comparable when the alternative distributions have heavy tails. 
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Table 1: Monte Carlo probabilities for various levels of significance using Z mOI I with 10' 
replications per sample size 



Presumed levels 






sample size 






two-sided test 


100 


150 


200 300 


500 


1000 


0.01 


0.0112 


0.0099 


0.0098 0.0098 


0.0102 


0.0100 


0.05 


0.0595 


0.0537 


0.0519 0.0513 


0.0504 


0.0501 


0.10 


0.1207 


0.1097 


0.1057 0.1027 


0.1009 


0.1003 


0.20 


0.2368 


0.2191 


0.2113 0.2052 


0.2019 


0.2004 
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Figure 3: Powers of the tests when the alternative distribution is Beta (o = 2, /9 = 1) 
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Table 2: Monte Carlo powers of some tests for normality, level of significance 0.05, population 
skewness y'/Ji, and kurtosis /32 



Alteriicitivc hypotheses 


V /^l 








V 


SW 


LM 


J— *diCli I C-t ^5 ^ -L J 


—0.57 


2.4 


40 


0.771 


0.257 


0.706 


0.515 








50 


0.894 


0.347 


0.855 


0.647 








60 


0.945 


0.424 


0.925 


0.720 








80 


0.985 


0.561 


0.987 


0.849 








100 


0.995 


0.703 


0.999 


0.920 




-0.29 


2.4 


40 


0.284 


0.048 


0.153 


0.095 








50 


0.362 


0.059 


0.213 


0.141 








60 


0.406 


0.062 


0.259 


0.153 








80 


0.510 


0.081 


0.400 


0.203 








100 


0.600 


0.114 


0.518 


0.263 




U.DO 


o.o 




n 9Q7 


u.oio 


n "^99 
u.ozz 


u.ooy 








50 


0.430 


0.377 


0.420 


0.469 








60 


0.556 


0.432 


0.500 


0.541 








80 


0.728 


0.570 


0.675 


0.673 








100 


0.847 


0.687 


0.789 


0.798 


Gamma (a = 3) 


1.16 


5.0 


40 


0.611 


0.684 


0.708 


0.751 








50 


0.803 


0.778 


0.829 


0.849 








60 


0.900 


0.851 


0.894 


0.912 








80 


0.976 


0.938 


0.968 


0.968 








100 


0.994 


0.978 


0.991 


0.991 


Gamma (a = 2) 


1.41 


6.0 


40 


0.810 


0.823 


0.880 


0.893 








50 


0.935 


0.895 


0.955 


0.950 








60 


0.980 


0.937 


0.981 


0.979 








80 


0.998 


0.988 


0.998 


0.996 








100 


1.000 


0.998 


1.000 


1.000 


log normal (/i = 0, cr = 1/2) 


1.80 


8.9 


40 


0.764 


0.833 


0.846 


0.884 








50 


0.909 


0.908 


0.932 


0.940 








60 


0.963 


0.945 


0.962 


0.975 








80 


0.993 


0.986 


0.992 


0.993 








100 


0.999 


0.996 


0.999 


0.999 
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Figure 4: Powers of the tests when the alternative distribution is Beta (a = 3, /9 = 2) 
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Figure 5: Powers of the tests when the alternative distribution is WeibuU (a = 2) 
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Figure 6: Powers of the tests when the alternative distribution is Gamma (a = 3) 
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Figure 7: Powers of the tests when the alternative distribution is Gamma (a = 2) 
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Figure 8: Powers of the tests when the alternative distribution is log normal (/i = 0, it = 1/2) 
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5. Concluding remarks 

Corresponding to the Pearson measure of skewness pms we propose 
spms (HI) as a skewness test statistic for normality. We obtain the normaUzing 
transformation IjlOp of the null distribution for spms based on the second and 
fourth approximate moments shown in ^ and For a moderate sample size, 
such as n > 200, the transformation is valid in practice, as shown in Table [T] 
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